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(57) Abstract 



A transcoder has a decoder fo decoding a received video signal which has been coded according to a first coding scheme employing 
motion compensation and carries coded data and motimi compensation information, and an encoder for encoding the output of the decoder 
according to a second coding scheme also employing motion compensation. The transcoder generates estimated motion vectors for a current 
frame of the video signal, using vectors which, in die received signal, accompany at least one other frame of the video signal. These may 
be used directly by the encoder or used to define a search area for motion estimation. 
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TRANSCODING 

This invention relates to a method and apparatus for transcoding video 
signals from a first format into a second format. 
5 Many techniques for compression of video signals are known. Broadcast 

quality television requires in excess of 100 Mbit/s when transmitted in digital form, 
which is expensive to transmit and requires links of high bandwidth. Known 
compression coding techniques may be used which take advantage of the high 
degree of spatial and/or temporal redundancy in the video signals being encoded. 

10 So, for example, in videoconferencing applications, compression down to a bit rate 
of a few hundred kilobits per second is possible whereas videotelephone quality 
pictures Including sound can be compressed down to only 64 kilobits per second, 
equivalent to a single telephone channel. 

One known compression technique is predictive coding which exploits the 

15 assumption that a picture element (pixel) within a frame is related to neighbouring 
pixels in the same frame (spatial prediction) or a different frame (temporal 
prediction) and that the value of a pixel may therefore be predicted at a receiver 
instead of full information about the pixel being transmitted. It is only necessary to 
transmit the prediction error arising from such an assumption. For instance the 

20 first pixel of a frame may be transmitted exactly with each subsequent pixel being 
transmitted as the difference between it and Its predecessor. 

To reduce further the amount of information needed to be transmitted, a 
technique known as motion compensation may be used in which a picture is 
divided Into blocks of pixels and each block of a current frame is compared with a 

25 corresponding block of a reference frame, which may be a previous or a 
subsequent frame, and with shifted positions of that block, and the region of the 
reference frame which the block most closely resembles is identified. The vector 
difference in position between the identified region and the block in question is 
termed a motion vector and is used to shift the identified region of the reference 

30 frame into the position of the relevant block in the current frame. Motion vectors 
are generated for most or .all the blocks of a current frame and are used to derive a 
predicted frame from the reference frame(s). The differences between the current 
and predicted frame are, on average, smaller than those between the current and 
reference frame and can be encoded using less data. A decoder which already has 
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the reference frames stored can thus reproduce the current frame using the motion 
vectors and the difference values. A signal may be coded using any of the 
aforementioned coding techniques either separately or in combination. 

There are circumstances when it is desirable to employ a transcoder which 
5 receives a signal encoded according to a first coding scheme and outputs an 
encoded data stream encoded according to a second scheme. If one has a 
decoder which operates according to the second coding scheme then such a 
transcoder would allow the decoding of a signal encoded according to the first 
coding scheme without modifying the original encoder or the ultimate decoder 
10 Known transcoders generally decode a signal encoded according to a first 

coding scheme into an uncompressed signal which is then encoded by an encoder 
according to the second coding scheme to output a new data stream. Thus a full 
decoding operation is carried out to reconstitute the original video signal and then 
this video signal is encoded to provide a new coded data stream according to the 
15 second coding scheme. For coding methods involving motion compensation new 
motion vectors are generated for the signal encoded according to the second 
coding scheme and this accounts for a large proportion of the processing time of 
- conventional transcoders. Transcoders are generally assumed not to exist in a 
transmission path i.e. a video signal encoded according to the H.261 standard is 
20 assumed to be received by a decoder conforming to the H.261 standard: the 
introduction of a transcoder into the transmission path will introduce delay into the 
path which delay may be unacceptable. 

International patent application W095/29561 describes a transcoder 
which receives a signal encoded according to a first coding scheme employing 
25 motion compensation and outputs a signal encoded according to a second coding 
scheme which also employs motion compensation, the motion vectors being 
extracted from the received video signal and transferred to the output signal. 
Thus, it is not necessary to recalculate the motion vectors for the second coding 
scheme. However this shceme presupposes that the received signal contains 
30 motion vectors suitable for use in the second coding scheme. 

According to the present invention there Is provided a transcoder 
comprising a decoder for decoding a received video signal coded according to a 
first coding scheme, and an encoder for reencoding the signal according to a 
second coding scheme, said coding schemes being such that at least some frames 
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are coded using motion-compensated inter-frame predictive coding based on a 
reference frame which is not the same in the second coding scheme as it is in the first 
coding scheme, wherein that the transcoder includes motion vector processing means 
to generate an estimated motion vector for a current frame of the video signal, the 
5 motion vector processing means being connected to receive, when processing vectors 
for a current frame, vectors which, in the received signal, accompany at least one 
other frame of the video signal. 

In another aspect, the invention provides a transcoder comprising a 
decoder for decoding a received video signal , said received signal being coded 

10 according to a first coding scheme employing motion compensation and comprising 
coded data and motion compensation information, and an encoder for encoding the 
output of the decoder according to a second coding scheme employing motion 
compensation, the encoder including motion estimation means for generating 
motion compensation information, wherein the motion estimation means is 

15 arranged to receive motion compensation information obtained from the received 
signal and to carry out motion estimation within a search area centred on the 
received motion compensation information, wherein, for use where at least some 
frames are coded using motion-compensated inter-frame predictive coding based on a 
reference frame which is not the same in the second coding scheme as it is in the first 

20 coding scheme, the transcoder includes motion vector processing means to generate 
an estimated motion vector for a current frame of the video signal, the motion vector 
processing means being connected to receive, when processing vectors for a current 
frame, vectors which, in the received signal, accompany at least one other frame of the 
video signal, and the motion estimation means being connected to receive the said 

25 estimate. 

In a further aspect; the invention provides a method of transcoding a 
received video signal coded according to a first coding scheme into a signal 
according to a second coding scheme, said coding schemes being such that at 
least some frames are coded using motion-compensated inter-frame predictive coding 
30 based on a reference frame which is not the same in the second coding scheme as it 
is in the first coding scheme, characterised by processing received motion vectors to 
generate an estimated motion vector for a current frame of the video signal, the 
processing step being responsive, when processing vectors for a current frame. 
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vectors which, in the received signal, accompany at least one other frame of the video 
signal. 

In yet another aspect of the invention there is provided a method of 
transcoding a signal representing a two-dimensional array, the method comprising 
5 decoding a received signal encoded according to a first coding scheme employing 
motion compensation, said signal comprising coded data and motion compensation 
information, and encoding the decoded output according to a second coding 
scheme employing motion compensation, the encoding including generating motion 
compensation information by carrying out motion estimation within a search area 

10 centred on motion compensation information obtained from the received signal, 
wherein, for use where at least some frames are coded using motion-compensated 
inter-frame predictive coding based on a reference frame which is not the same In the 
second coding scheme as it is in the first coding scheme, the method includes the step 
of motion vector processing to generate an estimated motion vector for a current frame 

15 of the video signal, the processing step being responsive, when processing vectors for 
a current frame, vectors which, in the received signal, accompany at least one other 
frame of the video signal, and the motion estimation serving to perform estimation 
within a search area centred on a position determined by the said estimate. 

Other, preferred, aspects of the invention are set out in the sub-claims. 

20 The invention will now be described further by way of example with 

reference to the accompanying drawings, in which: 
Figure 1 shows a known transcoder; 

Figure 2 shows the capture, coding and display order for video signals 
conforming to MPEG and H.261 coding standards; 
25 Figure 3 shows a second embodiment of a transcoder according to the 

invention; 

Figure 4 shows a portion of a frame store and illustrates the motion vector 
generation. 

A transcoder is used to convert signals encoded according to a first format 
30 into signals encoded according to a second format. Figure 1 shows a known form 
of transcoder which is arranged to convert video signals coded at a particular bit 
rate (e.g. 64k bit/s) conforming to the H.261 standard to video signals coded at a 
lower rate (e.g. 32k bit/s) conforming to the H.261 standard. Clearly the 
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transcoder may, in practice, be arranged to convert signals from and to other 
formats. 

The decoder part of the transcoder shown in Figure 1 comprises a de- 
multiplexer 1 which receives an incoming coded data stream conforming to the 
5 H.261 standard and de-multiplexes the data stream into its constituent parts of 
compressed video data and motion vectors. The compressed video data is then 
decoded by a variable length decoder (VLD) 2, and then passes to an inverse 
quantiser 4 which outputs values of the discrete cosine transform (DCT) 
coefficients. The DCT coefficients are then transformed back into the pixel domain 

10 by an inverse discrete cosine transform (IDCT) unit 6 to produce a video signal in 
the pixel domain. This signal is then added by an adder 10 to the previous frame 
(if any) stored in a frame store 8 and the resulting predicted frame is stored in the. 
frame store 8. Upon receipt of the first frame, the uncompressed frame is stored 
in a.fraine store 8 - i.e. the second input to the adder 10 is zero. For subsequent 

15 frames, the decoded data represent a prediction error and are added to the 
contents of the frame store 8 by the adder 10. However, the frame store output is 
motion-compensated by a motion compensator 1 1 controlled by motion vectors 
from the demultiplexer 1 . 

The frame output by the decoder part is then output to a subtracter 

20 12 of the encoder part, which also receives as an input the output of a frame 
store 14 of the encoder part, which stores a decoded version of the previous- 
encoded frame of the encoding loop. Before the contents of the frame store 14 
are input to the subtracter 12, motion compensation is carried out by a motion 
compensation unit 15 on the contents of the frame store 14, under control of a 

25 motion estimator 24 which, for each block of a frame to be encoded, searches the 
frame store 14 in the vicinity of the block position to identify a region which best 
resembles the block: the vector offset between the block position and the region 
identified forms a motion vector for controlling the motion compensation unit 1 5. 

The output of the subtracter 1 2 is converted to discrete cosine transform 

30 coefficients by a discrete cosine transform (DCT) unit 1 6, quantised by a quantiser 
1 8 and variable-length coded by a coder 26 for transmission. The motion vectors 
calculated by the motion estimator 24 are multiplexed into the data stream by a 
multiplexer 27. A buffer 28 buffers the encoded data stream output by the 
multiplexer 27, to provide an output at the bit rate required by the transmission 
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medium. This output may have a constant bit rate or a variable one. To generate 
the contents of the frame store 1 4, the output of the quantiser 1 8 is decoded by 
an inverse quantiser 20 and inverse DCT unit 22 and added to the motion- 
compensated contents of the frame store 14 by an adder 23. 
5 One aspect of the transcoder which involves a significant amount of 

processing power is the operation of the motion estimator 24; thus motion vector 
estimation for an 8x8 block is typically carried out within a search area of ±8 or 
±16 pixels in both directions around the block, involving, even for the +8 case, 
some 289 calculations. Consequently, efforts have been made to simplify this by 

10 making use in some way of the motion vectors already present in the Incoming 
signal. Thus, our. international patent application WO 95/29561 describes a 
transcoder like that of Figure 1 but proposes to eliminate the motion estimation 
unit 24 by feeding the incoming motion vectors to the motion compensation unit 
1 5 and the multiplexer 27, either directly or - in the case where the pixel resolution 

15 of the incoming and outgoing video signals differ - after scaling by an appropriate 
factor. 

Another approach has been suggested in US patent 5,600,646. There it is 
suggested that the incoming motion vector be used as a "seed" for the search; by 
starting the search at the displacement given by the corresponding vector from the 

20 decoder, a much smaller search area is found to be sufficient. In the US patent, a 
search extent of +3 pixels is suggested, though we have found that a search area 
of ±1 pixel gives useful results. 

The manner of inter-frame differential coding employed in the H.261 
standard is straightforward in that each predictive coding process is based on the 

25 previous frame as a reference. However not all known coding systems do so - the 
MPEG standard, described further below, is one example. The present invention 
aims to make use of the incoming motion vectors in a situation where the 
transcoder operates between two coding standards where at least some frames 
are coded using motion-compensated inter-frame predictive coding based on a 

30 reference frame which is not the same in the one standard as it is in the other 
standard. The transcoder now to be described is designed for use in one such 
situation, viz. the conversion of an MPEG-coded signal to an H.261 -coded signal. 

Firstly, the format of an MPEG signal will be explained. It should be noted 
that when in this description we use expressions such as "earlier", "later". 
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''preceding" or "next", this refers (unless explicitly stated otherwise) to the order 
of the franaes in capture and display order (i.e. as would be output by a camera, or 
seen on a display nnonitor), irrespective of the actual order in which the frames are 
coded or transmitted. In MPEG, frames are coded in three distinct ways, resulting 
5 in three different types of coded frame. An intra-frame, or l-frame, is not coded by 
reference to any other frame (i.e. it does not employ inter-frame differential coding 
at all). A predicted frame (P-frame) is coded by motion-compensated inter-frame 
differential coding relative to an earlier frame. This earlier frame must be either an 
1-frame or a P-frame and is, in general, not the immediately preceding frame. The 

10 third type of frame is a bidirectional frame (B-frame). One or more B-frames may 
occur between an l-frame or P-frame and the next l-frame or P-frame; the actual 
number is not restricted by the standard but in practice there are always two. For 
the purposes of description, we will refer to a sequence of frames beginning and 
ending with an l-frame, but containing no intermediate l-frames, as a group of 

15 pictures; and to a sequence beginning and ending with an l-frame or P-frame as a 
subgroup of pictures (this terminology is not necessarily the same as that used in . 
the published standard). Note that using the terminology (except at the beginning 
and end of a transmission) an l-frame belongs to two groups of pictures and an I- 
or P-frame belongs to two subgroups of pictures. It follows that a group of 

20 pictures contains one or more sub-groups of pictures. In practice there are four 
types of sub-group: IBBI, IBBP, PBBl and PBBP. 

Each block within a B-frame is coded in one of four ways: 

(a) by motion-compensated inter-frame differential coding based on the the 
most recent past frame which is not itself a B-frame (a motion vector for forward 

25 prediction may be transmitted); 

(b) by motion-compensated inter-frame differential coding based on the next 
future frame which is not itself a B-frame (a motion vector for backward 
prediction may be transmitted); 

(c) by interpolation: a prediction of the block for the purpose of differential 
30 coding is formed by interpolation between the motion-compensated inter-frame 

prediction based on the most recent past frame which is not itself a B-frame and 
the motion-compensated inter-frame prediction based on the next future frame 
which is not itself a B-frame (a motion vector for forward prediction and a motion 
vector for backward prediction may be transmitted); 
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(d) without inter-frame predictive coding (an "intra-block"). 

The coder decides which of (a), |b) and (c) to use according to which gives 
the best prediction and hence the most economical coding. Intra-btocks Id) are 
used if the coder finds that none of the other options gives 'a useful prediction. 
5 These do not occur often in practice. Note also that this decision is made on a 
block-by-biock basis, so that a given B-frame will more often than not involve the 
use of more than one of these four methods. 

Because backward prediction is used for the B-frames, it is necessary to 
. code (and decode) the frames within a subgroup in a different order from the 
10 capture and display order, though the order of the subgroups in unaffected. Thus, 
for an IBBP subgroup the I-frame is coded first, followed by the P-frame, and then 
the B-frames. As this is the order of decoding, the frames are transmitted in this 
order (this is not actually necessary, but minimises delay). 

Figure 2a shows ten frames of a video signal, in capture and display order, 
1 5 numbered 0 to 9. Figure 2b shows these same frames referenced with a letter I, P 
or B to indicate the type of coding to be used, followed by the same number, and 
then by a subscript indicating the order in which the frames are processed by an 
MPEG coder or decoder. In this description the frames will be referred to as 10, 81 
etc. and the subscripts added only where this assists understanding. This Figure 
20 also shows the motion vectors diagrammatically by means of arrows, where the 
arrowhead points to the reference frame. Forward motion vectors are notated as 
Fi,j where the first index indicates which frame the vector belongs to and the 
second index identifies the reference frame: thus F4,3 is the vector for frame B4, 
relative to frame P3 as reference frame. Backward vectors are indicated by R - 
25 e.g. R4,6 is the vector for frame B4, relative to frame 16 as reference frame. 

Figure 3 shows a transcoder according to one embodiment of of the 
invention arranged to receive signals encoded according to ISO/IEC standard 
1 1 172-2 (commonly known as MPEG1) and to output signals encoded according 
to H. 261. 

30 The transcoder has a similar structure to that of Figure 1 , and components 

having substantially the same function are indicated using the same reference 
numerals. The decoder part is formed by items 1 to 1 1 , plus a frame reordering 
unit 30 which receives the decoded frames in the order of receipt - i.e. the order 
shown in Figure 2c - and outputs them in display and capture order - i.e. as shown 
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in Figure 2a (it contains two frame stores and read-write circuitry to read and write 
the frames in the required order). This is a conventional' MPEG decoder except 
that (for reasons which will be explained) the frames are delayed longer than would 
be usual: for this reason the reordering unit 30 is followed by a one-frame delay 
5 35. The timing of the frames incoming to the transcoder ("IN") and output from 
the reordering unit ("OUT") is shown in the following Table: 



IN 


0 




3 




1 




2 




6 




4 




5 




9 




7 




8 




etc 








OUT 












0 




1 




2 




3 




4 




5 




6 




7 




8 




9 



In other words the picture reordering unit 52 receives frames 0, 3, 1; 
outputs frame 0; stores the next decoded frame 2; outputs frame 1 ; stores the 
decoded frame 6; outputs frame 2; stores the decoded frame 4; and so on. 
10 Clearly if more P and B frames occur between each I frame, the picture reordering 
unit 52 will need to be arranged to store more frames of data. 

On the encoder side, items 12 to 28 form an H.261 encoder just like the 
one in Figure 1 , except that the motion estimation unit 24 is replaced by a motion 
estimation unit .31. 

15 Before proceeding with description of the remainder of Figure 3, we 

observe that the encoder is required to produce an H.261 signal, that is, for the 
sequence depicted in Figure 2a, a sequence as shown in Figure 2d, where each 
frame after the first is a predicted frame based on the immediately preceding frame 
as reference, and requires the generation of a motion vector relative to that 
20 immediately preceding frame. Looking at the motion vectors provided by the 
MPEG signal, we see that: 

lOo and I64 will contain no vectors as they are reference pictures. 
P97 may only contain vectors relative to I64. 

B45 and BSe may contain forward vectors from P3i, and backward vectors 
25 from I64. 

B78 and B89 may contain forward vectors from I64, and backward vectors 
from P97. 

When such a MPEG signal is transcoded to H.261, the set of forward and 
30 backward vectors which may relate to non-consecutive pictures must be converted 
to a set of forward vectors that relate from one picture to the next. For this 
purpose, the embodiment as shown in Figure 3 includes a motion vector processor 
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32 for deriving forward motion vectors MV'i from the forward and backward 
motion vectors MV,. 

The one-frame delay mentioned earlier serves to ensure that, at the time 
the encoder comes to process any particular frame, all the vectors which might be 
5 needed have in fact been received. The motion vector processor 32 is preceded 
by a buffer 33 which is sufficiently large to contain all the vectors for five 
consecutive frames of the incoming signal to ensure that, at the time the processor 
comes to estimate motion vectors for any particular frame, all the vectors which 
might be needed are still available. 

10 The processor 32 is a program-controlled processing unit which serves to 

examine the contents of the buffer 33 and to extract from it, for each block of 
each frame, one or^ more vectors, and to perform, where necessary, arithmetic 
operations upon it or them so as to form at least one estimated motion vector for 
use in coding the frame under consideration according to the H.261 standard. 

15 There are four types of vector derivation that may be used. The first of these is 
the forward vector. Since the objective is to obtain a forward vector vyhich 
identifies for a frame j a region of the immediately preceding frame y-1 suitable for - 
prediction, the obvious candidate, if available, is the single step forward vector 
VFS from the MPEG frame. If, on the other hand, the MPEG frame carries a 

20 forward vector referencing a frame earlier by n frames (i.e. frame j-n), then this 
vector can be divided by n to give a vector usable for frame n or indeed for the 
intervening frame(s) j-1 ... j-n + 1. This is notated here generically as VFN or, for 
specific values of n as \/F2, \/F3 etc. 

A second type of derivation takes note of the fact that if the MPEG signal 

25 contains a backward vector for a frame j-1 (or j-n) relative to frame j, then the 
negative of this (divided by n if need be) is a fair estimate VRS (or VRN) of the 
wanted vector. 

Thirdly, one may take the difference between two forward vectors to obtain 
a forward difference vector VRD; and fourthly the negative of the difference 
30 between two backward vectors (VRD). 

A mixed vector VMX, for example, for frame B78 = F9,6 + R7,9, may also be 
useful but is not included In the present version. 

Because none of the vectors shown in Figure 2b is actually guaranteed to 
occur in particular subgroup of pictures, one may also include the possibility of 
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using the vector derived for a nearby frame (notated as VFS(j-l) etc.). This nneans 
that the probability of no derived vector being available becomes very small: 
however, should this occur (and it will only occur if the block in question or the 
corresponding block in its (later) reference frame was coded as an intra-block) then 
5 that block can be coded as an intra- block in the outgoing H.261 signal. 

Most of these possibilities are set out in the Table below for blocks 16, B7, B8 and 



P9. 



Frame 


VFSA/FN 


VRSA/RN 1 


VFD 


VRD . 




1) VFS(j+1) 
= F7,6 

4) VF2{H) = 
y2F5,3 

6) VFS(i-2) =F4,3 
9) VF3(j + 3) = 
V3F9,6 


2) VRS(j-l) = -R5,6 

7) VR2(j:2) = 
-y2R4,6 

8) VR2(j + 1) = 
-y2R7,9 


3) VFD(h1) = 
F5,3-F4,3 


5) VRD(j-l) = 
- (R4,6-R5,6) 


B78 


1) VFS = F7,6 
3) VF2(j+1) = 
y2F8,6 

7) VF3(j + 2) = 
V3F9,6 


2) VR2 = -y2R7,9 
4) VRS(j + 1) = -R8,9 
9) VRS(j-2) = -R5,6 


5) VFD{j + 1) = 

F8,6-F7,6 

8) VFD{j + 2) - 

F9,6-F8,6 


6) VRD(j + 1) = 
- (R7,9-R8,9) 


B89 


1) VF2 = y2F8,6 

2) VFS(j-l) = 
F7,6 

6) VF3(j+1» = 
V3F9,6 


7) VR2(j-1> = 
-y2R7,9 

3) VRS = -R8,9 


4) VFD = F8,6- 
F7,6 

8) VFD(j+1) = 
F9,6-F8,6 


5) VRD = 
- (R7,9.R8,9) 


P9, 


5) VF3 = V3F9,6 

6) VF2(j.1) = 
y2F8,6 

7) VFS{j-2) = 
F7,6 

8) VFS(j+1) 
= F10,9 

11) VF3 {j + 3) = 
V3F12,9 


1) VRS(j-l) = -R8,9 

9) VR2(i-2) = 
-y2R7,9 

10) VR2{j + 1) = 
-y2R10,12 


2) VFD = F9,6- . 
F8,6 

3) VFD(j-l) = 
F8,6-F7,6 


4) VRD(j-l) = 
- (R7,9-R8,9) 
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By way of explaining some of these more fully: 

For B7e, the single forward vector VFS is simply its forward vector F7,6 
from I64. Its scaled backward vector is minus one half of its backward vector R7,9 
from P97. There is no forward difference vector and no backward difference 
5 vector. 

For B89, the scaled forward vector is half its forward vector F8,6 from I64. 
Its scaled backward vector is minus its backward vector from P97. The backward 
difference vector is the difference between its own backward vector and that of 
BTq. Its forward difference is the difference between its own forward vector and 
10. BTe's forward vector. 

For P97, the scaled forward vector is one third of its forward vector from 
I64. Its forward difference is the difference between P97's forward vector and 
BSg's forward vector. There is no scaled backward vector and no backward 
difference vector. 

15 For I64, there are no vectors: vectors for this frame must be derived from 

vectors for nearby frames. 

There are three possible modes of operation for this motion vector 

derivation (it being assumed that any given version of the apparatus uses one of 

these only):: 
20 (i) , hierarchical; • - 

(ii) evaluation; 

(iii) a combination of the two. 

In the hierarchical method, the processor 32 operates in accordance with a 
preferred order of the vector types. These depend on the frame type, and are 
25 shown by the numbering in the table, so for 16, the hierarchy might be: VFS(j + 1), 
VRS(j-l), VFD(j-l), VF2(j-l), VRD(j-l), VFS(j-2), VR2(j-2), VR2(J + 1) and VF3(j-f 3). 

Thus for 16, the processor reads VFS{j+ 1) from the buffer if it is available. 
If not, it reads VRS(j>1). If this is not available then it reads out the two vectors 
necessary to calculate VFD(j-1), and subtracts them to form the derived vector. 
30 And so on. 

In the evaluation method, the processor computes all the available 
candidate vectors for the current block and forwards them to the motion vector 
estimation unit 31 which then evaluates them by, for each vector, forming a 
predicted block using that vector (after a search as described below, if this option 
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is being used) and calculating a metric (such as the sum of absolute differences) 
between the predicted block and the block to be encoded. The vector to be 
chosen is the one giving the lowest metric. 

The combined method is the same as the evaluation method, except that 

5 the number of vectors to be evaluated is limited to the first few (perhaps three) 
available vectors, considered in the order prescribed by the list. 

It is possible, though non-optimum, to use the vector obtained in this way 
directly: in fact, if one does so, using the hierarchical method the motion vector 
evaluation unit 31 becomes just a through-connection. We prefer, however, to use 

0 the vector to establish an offset for the motion vector estimation unit 31 to 
perform a search around a small area around the position established by the 
vector. Thus the the motion vector estimation unit 31 carries out motion 
estimation in a search area, the location of which is determined by the received 
motion vectors. 

15 This is illustrated in Figure 4, which shows a portion of the frame store 

14, each square 20 representing a picture element (or pixel). Conventionally, 

motion vectors are determined for blocks of pixels rather than for each individual 

pixel; Figure 4 therefore shows a block 22 of 4 x 4 pixels. For a prior art 

transcoder, for instance as is described in W095/29561, the motion vectors would 
20 simply result in the prediction for the block currently under consideration being the 

region within the frame store determined by the received motion, vector MV, for 

the block, before the subtracter 12 carried out its operation. 

In a transcoder according to this version of the invention however, further 

motion estimation is carried out. The motion vector MV', from the processor 32 
25 initiates a search in the frame store 14 around a position offset from the current 

block under consideration by an amount represented by the motion vector 

associated with the current block. 

For instance, suppose that the motion vector MV'i received for a current 

block (shown is solid lines in Figure 4) at the position A in a current picture 
30 specifies a shift from the the position A to the position B shown in double lines. 

Further motion estimation is then carried out in a search area 26 around position B 

(indicated by dotted lines). 

Conventionally, according to the H.261 or the MPEG standard, motion 

estimation is carried out for a search area of ± 8 or ±16 pixels around the block. 
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Although a full motion estimation search may be carried out, it has been found that 
a severely limited search in the region ± 1 pixel (as shown by the search area 26) 
gives significantly improved results compared to the arrangement described in 
W095/29561. Such a restricted search area means that the number of 
calculations required to determine a motion vector for the second coding scheme is 
reduced (for a 8 x 8 block and a search area of ± 8 pixels) from 289 calculations 
to 9 calculations. The calculated motion vector MV2 is then added to the motion 
vector MV'i to form a new motion vector MV3 for the block. Once a new motion 
vector MV3 is calculated the block of the frame store 14 centred on the new 
motion vector MV3 is output to the subtracter 1 2 and the motion vectors MV3 are 
input to the multiplexer 27. 

Although the transcoder described, is designed to transcode MPEG signals 
into H.261 signals, the principle is applicable to other situations where at least 
some frames are coded using motion-compensated inter-frame predictive coding 
based on a reference frame which is not the same in the one standard as it is in the 
other standard. For example the transcoder of Figure 3 could be configured for the 
reverse conversion. If the motion vectors for the H.261 frames shown in Figure 2d are 
denoted by F1,0, F2,1, F3,2 etc., then estimates F'iJ of the motion vectors required to 
construct an MPEG signal can be constructed as follows: 

F7,6 = F7.6 

F'8,6 = F8,7 -f F7,6 

F'9,6 = F9,8 + F8,7 + F7,6 

R'7,9 = -(F9,8 + F8,7) 

R'8,9 = -F9,8. 

In the event that one or more vectors is absent from the incoming signal, 
vector estimates for non-consecutive frame prediction might be generated by 
multiplying the incoming vectors, e.g.: 

F'8,6 = 2 * F8J 

F'8,6 = 2 * F7,6 

F'9,6 = 3 * F9.8 

F'9,6 = 3 * F8,7 

F'9,6 = 3 * F7,6 

R'7,9 = -2 * F9,8 

R'7,9 = -2 * F8J, 



wo 99/291 13 



PCT/GB98/03553 



15 . , 

CLAtMS 

1. A transcoder comprising a decoder for decoding a received video signal 
5 coded according to a first coding scheme, and an encoder for reencoding the signal 
according to a second coding scheme, said coding schemes being such that at 
least some frames are coded using motion-compensated inter-frame predictive coding 
based on a reference frame which is not the same in the second coding scheme as it 
is in the first coding scheme, characterised in tfiat the transcoder includes motion 
10 vector processing means (32) to generate an estimated motion vector for a current 
frame of the video signal, the motion vector processing means being connected to 
receive, when processing vectors for a current frame, vectors which, in the received 
signal, accompany at least one other frame of the video signal. 

15 2. A transcoder comprising a decoder for decoding a received video signal, 
said received signal being coded according to a first coding scheme employing 
motion compensation and comprising coded data and motion compensation 
information, and an encoder for encoding the output of the decoder according to a 
second coding scheme employing motion compensation, the encoder including 

20 motion estimation means (31) for generating motion compensation information, 
wherein the motion estimation means is arranged to receive motion compensation 
information obtained from the received signal and to carry out motion estimation 
within a search area centred on the received motion compensation information, 
ctiaracterised in that, for use where at least some frames are coded using motion- 

25 compensated inter-frame predictive coding based on a reference frame which is not 
the same in the second coding scheme as it is in the first coding scheme, the 
transcoder includes motion vector processing means (32) to generate an estimated 
motion vector for a current frame of the video signal, the motion vector processing 
means being connected to receive, when processing vectors for a current frame, 

30 vectors which, in the received signal, accompany at least one other frame of the video 
signal, and the motion estimation means (31) being connected to receive the said 
estimate. 
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3. A transcoder according to claim 2 wherein the signals represent a video 
image divided into blocks of picture elements, wherein the motion estimation 
means is arranged to carry out motion estimation on a block-by-block basis. 

5 4. A transcoder according to any one of the preceding claims, including a 
buffer (33) for buffering the received motion vectors and delay means (35) for 
delaying the video signals prior to reencoding. 

5. A transcoder according to any one of the preceding claims in which the 
10 estimated motion vectors generated by the motion vector processing means include: 

at least one motion vector which is the reverse of the motion vector from a frame of the 
received signal other than the current frame. 

6. A transcoder according to any one of the preceding claims in which the 
1 5 estimated motion vectors generated by the motion vector processing means include: 

at least one motion vector for previous-frame prediction which is a scaled version of 
the motion vector from a frame of the received signal which involved prediction from a 
non-consecutive earlier frame. 

20 7. A transcoder according to any one of the preceding claims in which the 
estimated motion vectors generated by the motion vector processing means include: 
at least one motion vector which is an arithmetic combination of motion vectors from a 
plurality of frames of the received signal. 

25 8. A transcoder according to any one of the preceding claims in which the 
motion vector processing means (32) is operable to generate one of more of a 
plurality of possible estimated motion vectors in accordance with a predetermined 
hierarchy. 

30 9. A transcoder according to any one of the preceding claims in which the 
motion vector processing means (32) is operable to generate a plurality of 
candidate estimated motion vectors and the or a motion estimation means (31) is 
operable to evaluate the candidate estimated motion vectors and to select one in 
accordance with a predetermined criterion. 



wo 99/29113 



PCT/GB98/03553 



17 

10. A transcoder according to claim 2 or any one of claims 3 to 9 when 
dependent on claim 2 wherein the motion estimation means (32) is arranged to 
carry out motion estimation within a search area of |n±1)x|m±1)picture elements 

5 centred on the estimated motion vector. 

11. A transcoder according to any any one of the preceding claims wherein 
the two coding schemes employ different orders of transmission of the frames of 
the video signal ahd the transcoder further includes re-ordering means 130) for re- 

10 ordering the received signal. 

12. A method of transcoding a received video signal coded according to a 
first coding scheme into a signal according to a second coding scheme, said coding 
schemes being such that at least some frames are coded using motion-compensated 

1 5 inter-frame predictive coding based on a reference frame which is hot the same in the 
second coding scheme as it is in the first coding scheme, characterised by processing 
received motion vectors to generate an estimated motion vector for a current frame of 
the video signal, the processing step being responsive, when processing vectors for a 
cun^ent frame, vectors which, in the received signal, accompany at least one other 

20 frame of the video signal. 

13. A method of transcoding a received video signal representing a two- 
dimensional array, the method comprising decoding a received signal encoded 
according to a first coding scheme employing motion compensation, said signal 

25 comprising coded data and motion compensation information, and encoding the 
decoded output according to a second coding scheme employing motion 
compensation, the encoding including generating motion compensation iriformation 
by carrying out motion estimation within a search area centred on motion 
compensation information obtainwed from the received signal, characterised in 

30 that, for use where at least some frames are coded using motion-compensated inter- 
frame predictive coding based on a reference frame which is not the same in the 
second coding scheme as it is in the first coding scheme, the method includes the step 
of motion vector processing to generate an estimated motion vector for a cunrent frame 
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of the video signal, the processing step being responsive, when processing vectors for 
a current frame, vectors which, in the received signal, accompany at least one other 
frame of the video signal, and the motion estimation serving to perform estimation 
within a search area centred on a position determined by the said estimate. 

5 

14. A method of transcoding according to claim 14 wherein the signals/ 
represent a video image divided into blocks of picture elements, and the motion 
estimation is carried out on a block-by-b)ock basis. 

TO 15. A method according to any one of claims 12 to 14, including buffering the 
received motion vectors and delaying the video signals prior to reencoding.' 

16. A method according to any one of claims 12 to 15 in which the estimated 
motion vectors generated by the motion vector processing include: 

1 5 at least one motion vector which is the reverse of the motion vector from a frame of the 
received signal other than the current frame. 

17. A method according to any one of claims 12 to 16 in which the estimated 
motion vectors generated by the motion vector processing include: 

20 at least one motion vector for previous-frame prediction which is a scaled version of 
the motion vector from a frame of the received signal which involved prediction from a 
non-consecutive earlier frame. 

18. A method according to any one of claims 12 to 17 in which the estimated 
25 motion vectors generated by the motion vector processing include: 

at least one motion vector which is an arithmetic combination of motion vectors from a 
plurality of frames of the received signal. 

19. A method according to any one of claims 12 to 18 in which the motion 
30 vector processing generates one of more of a plurality of possible estimated 

motion vectors in accordance with a predetermined hierarchy. 



20. A method according to any one of claims 12 to 19 in which the motion 
vector processing generates a plurality of candidate estimated motion vectors and 
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the method includes motion estimation to evaluate the candidate estimated motion 
vectors and to select one in accordance with a predetermined criterion. 

21 . A method of transcoding according to claim 13 or any one of claims 14 to 
5 20 when dependent on claim 13 wherein the motion estimation is carried out 
within a search area of (n±1)x(m±1) picture elements centred on the estimated 
motion vector. 



10 



22. A method according to any of claims 12 to 21 wherein the two coding 
schemes employ different orders of transmission of the frames of the video signal 
and the method further includes re-ordering the decoded received signal. 
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